Early Access: The content on this website is provided for informational purposes only in connection with pre-General Availability Qlik Products.
All content is subject to change and is provided without warranty.
Skip to main content

Change data partitioning on Amazon EMR

When Change Data Partitioning is enabled, the Replicate Change Tables in Hive are partitioned by the partition_name column. Data files are uploaded to Amazon S3 storage, according to the maximum size and time definition, and then stored in a directory under the Change Table directory. Whenever the specified partition timeframe ends, a partition is created in Hive, pointing to the Amazon S3 storage.

Information about the partitions is written to the attrep_cdc_partitions Control Table.

 

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!